Extreme events on complex networks 
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We study the extreme events taking place on complex networks. The transport on networks is 
modelled using random walks and we compute the probability for the occurance and recurrence of 
extreme events on the network. We show that the nodes with smaller number of links are more prone 
to extreme events than the ones with larger number of links. We obtain analytical estimates and 
verify them with numerical simulations. They are shown to be robust even when random walkers 
follow shortest path on the network. The results suggest a revision of design principles and can be 
used as an input for designing the nodes of a network so as to smoothly handle an extreme event. 
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Extreme events(EE) taking place on the networks is 
a fairly common place experience. Traffic jams in roads 
and other transportation networks, web servers not re- 
sponding due to heavy load of web requests, floods in 
the network of rivers, power black outs due to tripping of 
power grids are some of the common examples of EE on 
networks. Such events can be thought of as an emergent 
phenomena due to the transport on the networks. As EE 
lead to losses ranging from financial and productivity to 
even of life and property [ij , it is important to estimate 
probabilities for the occurance of EE and, if possible, in- 
corporate them to design networks that can handle such 
EE. 

Transport phenomena on the networks have been stud- 
ied vigorously in the last several years 0, Q though they 
were not focussed on the analysis of EE. However, one 
kind of extreme event in the form of congestion has been 
widely investigated For instance, a typical approach 
is to define rules for (a) generation and transport of traf- 
fic on the network and (h) capacity of the nodes to service 
them. Thus, a node will experience congestion when its 
capacity to service the incoming 'packets' has been ex- 
ceeded Q. In this framework, several results on the sta- 
bility of networks, cascading failures to congestion tran- 
sition etc. have been obtained. Extreme event, on the 
other hand, is defined as exceedences above a prescribed 
quantile and is not necessarily related to the handling 
capacity of the node in question. It arises from natu- 
ral fluctuations in the traffic passing through a node and 
not due to constraints imposed by capacity. Thus, in rest 
of this paper, we discuss transport on the networks and 
analyse the probabilities for the occurance of EE arising 
in them without having to model the dynamical processes 
or prescribe capacity at each of the nodes. 

The transport model we adopt in this work is the ran- 
dom walk on complex networks [1]. Random walk is of 
fundamental importance in statistical physics though in 
real network settings many variants of random walk could 
be at work [6|. For instance, in the case of road traffic, the 
fiow typically follows a fixed, often shortest, path from 



node A to B and can be loosely termed deterministic. 
As we show in this paper, thresholds and corresponding 
probabilities for the EE depend on such details as the 
operating principle of the network. Thus, given the oper- 
ational principle of network dynamics, i.e., deterministic 
or probabilistic or a combination of both, can the nodes 
of the network be designed to have sufficient capacity 
to smoothly handle EE of certain magnitude? We show 
that we can obtain apriori estimates for the volume of 
transport on the nodes given the static parameters and 
operating principle of the network. Currently, for uni- 
variate time series, there is a widespread interest on the 
extreme value statistics and their properties, in particu- 
lar in systems that display long memory 7]. Thus, we 
place our results in the context of both the random walks 
and EE in a network setting. 

We consider a fully connected, undirected, finite net- 
work with N nodes with E edges. The links are described 
by an adjacency matrix A with whose elements Aij are 
either 1 or depending on whether i and j are connected 
by a link or not respectively. On this network, we have W 
non-interacting walkers performing the standard random 
walk. A random walker at time t sitting on ith node 
with Ki links can choose to hop to any of the neigh- 
bouring nodes with equal probability. Thus, transition 
probability for going from ith to jth node is Aij/K. We 
can write down a master equation for the rt— step transi- 
tion probability of a walker starting from node i at time 
n = to node j at time n as. 
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It can be shown that the n— step time-evolution operator 
corresponding to this transition, acting on an initial dis- 
tribution, leads to stationary distribution with eigenvalue 
unity Q and it turns out to be 
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The existence of stationary distribution is crucial for 
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defining EE. Physically, the time-independent probabil- 
ity in Eq. [2] implies that more walkers will visit a given 
node if it has more links. 

Now we can obtain the distribution of random walk- 
ers on a given node. We ask for the probability /(w) 
that there are w walkers on a given node having degree 
K. Since the random walkers are independent and non- 
interacting, the probability of encountering w walkers at 
a given node is while rest oi W — w walkers are dis- 
tributed on all the other nodes. This turns out to be 
binomial distribution given by 
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Now, the mean and variance for a given node can be 
explicitly written down as 
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Quite as expected, the mean and the variance depends 
on the degree of the node for fixed W and E. Note that 
K/2E << 1 and the relation between the mean and the 
variance for walkers passing through node can be written 
as (7 w {/y^^- This reproduces the relation proposed in 
Ref . [§] , later shown to have limited validity flO] . 

One natural extension of the result in Eq. [3] is to ac- 
count for fluctuations in the number of walkers. We as- 
sume that the total number of walkers is a random vari- 
able uniformly distributed in the interval [W — A, W+A]. 
Then the probability of finding w walkers becomes 
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where W = W — A. The mean and variance of this 
distribution can be obtained as, 
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In the spirit of extreme value statistics, an extreme 
event is one whose probability of occurance is small, typ- 
ically associated with the tail of the probability distribu- 
tion function. In the network setting, we will apply the 
same principle to each of the nodes. Based on Eqns[3]|4l 
we will designate an event to be extreme if more than q 
walkers traverse a given node at any time instant. No- 
tice that necessarily the cut-off q will have to depend on 
the node (or rather, the traffic flowing through the node) 
in question. Applying uniform threshold independent of 
the node will lead to some nodes always experiencing an 
extreme event while some others never encountering any 
extreme event at all. Hence we define the threshold for 
extreme event to be q = (/) -t- mcr, where m is any real 
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FIG. 1: Probability for the occurance of extreme events (EE) 
as a function of degree of a node for (a) A = 0, (b) A = 
O.OIVK and (c) A = O.IW. The threshold for EE is g = 
(/) -I- 4(7. The solid circles are obtained from simulations 
and the solid lines from analytical result in Eq. [S] All the 
numerical results shown in this paper are obtained with a 
scale-free network (degree exponent 7 = 2.2) with A = 5000 
nodes, E — 19815 vertices and W = 2E walkers averaged over 
100 realisations. Each realisation corresponds to a new set of 
randomly chosen initial conditions to begin the random walk. 



number. Then, the probability for extreme event can be 
obtained as 
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where [u\ is the floor function defined as the largest in- 
teger not greater than u. 

It does not seem possible to write this summation in 
closed form. However, for the special case when A = 
Eq. |5] simplifies to 

w 

F{K)^ E I{K)=I^{[q\+l,w-[q\) (9) 

k=yq\+l 

where Ip{-, ■) is the regularized incomplete Beta function 
ll| . For a given choice of network parameter E and num- 
ber of walkers W, the extreme event probability at any 
node depends only on its degree. In Fig[T]we show F{K) 
as a function of degree K superimposed on the results 
obtained from random walk simulations. The agreement 
between Eq. [T] and the simulated results is quite good. 
Further, each point in the figure represents an average 
over all the nodes with the same degree. We emphasise 
that the oscillations seen in Fig [T] are inherent in the an- 
alytical and numerical results and not due to insufficient 
ensemble averaging. 

An important feature of this result is that the nodes 
with smaller degree {K < 20) reveal, on an average, 
higher probability for the occurance of EE as compared 
to the nodes with higher degree, say, K > 100. By care- 
ful choice of parameters, the probability F{K) can differ 
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FIG. 2: (Color online) Probability for extreme events(EE) 
for several values of threshold q = (/) + ma. (a) shows 
the extreme event probabilities in log-log plot obtained from 
simulations with A = 0. while (b) shows Scaling for the 
same. So represents the reference slope with m = 2. The 
threshold applied for curves from top to bottom are m — 
2,2.5,3,3.5,4,4.5,5. 



by as much as an order of magnitude. This runs contrary 
to a naive expectation that higher degree nodes garner 
more traffic and hence are more prone to EE. While the 
former contention is still true in the random walk model 
we employ but the results here indicate that the latter 
one is not generally correct. As shown in Fig. [1] this 
feature is robust even when the number of walkers be- 
comes a fluctuating quantity. It must be pointed out 
that Eq. |S][3]for the extreme event probability does not 
depend on the parameters related to the topology of the 
network. Thus, even though the simulation results are 
shown for scale-free graphs, it holds good for other types 
of graphs (not shown here) with random and small world 
topologies. However, the difference in probabilty between 
higher and lower degree nodes is not pronounced in the 
case of random graphs. 

The threshold q that defines an event to be extreme 
depends on the traffic flowing through a given node. The 
choice q= if) + ma is arbitrary. Now, we show that the 
extreme event probability in Eq IH] scales with the choice 
of threshold q or, equivalently, m. In the Fig [DJa) we 
show Fm{K) for various choices of m in log- log scale. 
Clearly, as m decreases, ignoring the local fluctuations, 
the curves tend to become horizontal. Physically, this 
can be understood in the following way; q ^ Q implies 
that the threshold for EE decreases and this leads to 
larger number of EE and hence higher probability of oc- 
curance. In the limiting case of g — 0, all the events 
would be extreme and we see an equal probability of oc- 
curance of EE at all the nodes. The graph in Fig 
suggests that it might be scaling with respect to q or m. 
Starting from Eq. [HI we were not able to determine the 
scaling analytically. Hence, we empirically show that the 
following type of scaling relation holds for the probability 
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FIG. 3: (Color online) The inset shows the recurrence time 
distribution for extreme events from simulations (symbols) 
with A = for nodes with 5, 12 and 19 links. The solid 
line is the analytical distribution. The main figure shows the 
mean recurrence time as a function of degree K. 



of EE, 



F^{K) 



constant 



(10) 



where Fm{K) represents extreme event probability for 
threshold value q with parameter m. In this, Sm is the 
slope of the curves Fm{K) in the Fig. [2lja). In Fig. 
I^b), we show the effect of scaling for several choices of 
q. Using Eq. [TU] on the simulated data for A = 0, we 
flnd that all the curves for the probability of EE collapse 
into one curve to a good approximation. 

In the study of EE, distribution of their return inter- 
vals is an important quantity of interest. This carries 
the signature of the temporal correlations among the EE 
and is useful for hazard estimation in many areas. We 
focus on the return intervals for a given node of the net- 
work. Since the random walkers are non-interacting, the 
events on the node are uncorrelated. Then, the recur- 
rence time distribution is given by P(t) = e"^/<^>, where 
the mean recurrence time is (t) = \/F{K). In the in- 
set of Fig. [3j we show the recurrence time distribution 
obtained from random walk simulations for three nodes 
which have different degrees. In semi-log plot, they re- 
veal an excellent agreement with the analytical distribu- 
tion P{t) (shown as solid line). The main graph of Fig. 
|3] shows the mean recurrence time (r) , the only param- 
eter that characterises the recurrence distribution, as a 
function of the degree and it agrees with the analytical 
result. 

As pointed out before, many types of flow on the net- 
work, such as the information packets flowing through 
the network of routers and traffic on roads, use more 



intelligent routing algorithms 12| rather than perform- 
ing a random walk. In order to check the robustness of 
results in Eq. [BJSl we implemented the random walk sim- 
ulation with the constraint that the traffic from node i 
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FIG. 4: Extreme event probability Fsp with shortest path al- 
gorithm implemented for random walkers. The data is plotted 
in two different ways, (a) Fsp{b) as a function of betweenness 
centrality, (b) Fsp{K) as a function of degree K of the node. 
Nodes with same value of K can have different betweenness 
centrality. In (b), in order to reduce the clutter, for every 
value of K, the extreme event probability for the node with 
largest (solid circles, red) and least value (solid square, black) 
of b is plotted. The fluctuation parameter A = 0. 



to j takes the shortest path on the network. If multiple 
shortest paths are available to go from node i to j, the al- 
gorithm chooses any one of them with equal probability. 
Thus, in this setting, for every random choice of source- 
destination pair the paths are laid out by the algorithm 
and randomness arises only when multiplicity of shortest 
paths are available. In this sense, this can be thought 
of as walk with a large deterministic component. We 
used the shortest path algorithm developed in Ref. [1]. 
The simulation results, shown in Fig 2] as solid circles, 
are qualitatively similar to the trend displayed in Fig. [1] 
In this scenario of predominantly deterministic dynam- 
ics due to shortest paths constraint, it is conceivable that 
the degree of a node does not determine the flux passing 
through it. This role is played by the centrality of the 
node with respect to the shortest paths in the network 
and this is quantified by the betweenness centrality b of 
a given node [3]. Based on this qualitiative argument, 
the results in Fig. 2] can be understood if we replace Eq. 
[5] with p = f3b/B where B is normalisation factor that 
depends on the sum of betweeness centrality of all the 
nodes on the network. From the numerical simulations, 
we obtain /3 « 0.94. Using this p in Eq. [21 we can go 
through the same set of arguments as before and obtain 
(/), cr^, g and the probability for occurance of EE Fsp{b) 
analytically. In FigUJ^a) Fsp{b) is shown as solid curve. 
In Fig|4l[b) the same data for Fsp{b) is shown as a func- 
tion of K for easier comparison with Fig [TJ Thus, even 
with the shortest path algorithm thrown in, the extreme 
event probabilities are higher for the nodes with lower 
degree {K < 20) than for the ones with higher degree 
(K > 100). 



Finally we comment on how these results can be ap- 
plied as a basis to design nodes of a network. The central 
result in this paper in Eq. |5] allows us to apriori estimate 
the extreme event probabilities. These estimates depend 
on whether operating principle of dynamics can be mod- 
elled as a purely random walk or on the basis of shortest 
paths. If the idea is to avoid congestion or any other 
problems arising due to EE of certain magnitude, then 
these estimates can be used as an input to the design 
principles for the nodes. For instance, for the road traf- 
fic that operates broadly on the shortest path principle 
the probabilities can be used as an input to design prin- 
ciples (such as higher capacity to nodes) that will avoid 
bottlenecks arising from EE of a given magnitude. 

In scale-free networks, low degree nodes, which are 
more prone to experience the EE, form the bulk. But de- 
sign principles and practice generally focus on the hubs. 
The results in this work suggest that a revision from such 
an approach is necessary. A careful design for the capac- 
ity of low degree nodes needs to be given equal impor- 
tance. It must be emphasised that incorporating such 
extreme event estimates in design principles will only 
help in better preparedness to meet the expected ex- 
treme event. The extreme events discussed here being 
due to inherent fluctuations will nevertheless take place 
and cannot be avoided. 
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